gpaligner: a fast algorithm for global pairwise alignment of dna sequences

نویسندگان

mostafa hadian dehkordi

ali masoudi-nejad

morteza mohamad-mouri

چکیده

bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. many computational algorithms have been applied for solving the sequence alignment problem. dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods applied to this problem. we introduce gpaligner, a fast pairwise dna-dna global alignment algorithm. gpaligner uses similar score schema with dialign-t to produce the final alignment. it also uses the concept of “spaced seeds” to determine locally aligned subsequences which construct semi-global alignment as the preliminaries of global alignment computation. this enables gpaligner to have the precision provided by the dialign-t algorithm in considerably less time and space complexities. we performed benchmarking of our approach based on numerous datasets from standard benchmarking databases and real sequences of ncbi database where gpaligner performed three times faster than dialign-t. gpaligner is a new alternative for having sensitivity and selectivity of dialign-t but with less computational cost.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

متن کامل

Minimap2: fast pairwise alignment for long DNA sequences

Motivation: Recent advances in sequencing technologies promise ultra-long reads of ∼100 kilo bases (kb) in average, full-length mRNA or cDNA reads in high throughput and genomic contigs over 100 mega bases (Mb) in length. Existing alignment programs are unable or inefficient to process such data at scale, which presses for the development of new alignment algorithms. Results: Minimap2 is a gene...

متن کامل

a fast algorithm for exonic regions prediction in dna sequences

the main purpose of this paper is to introduce afast method for gene prediction in dna sequences based on the period-3 property in exons. first, the symbolic dna sequences are converted to digital signal using the eiip method. then, to reduce the effect of background noise in the period-3 spectrum, we use the discrete wavelet transform (dwt) at three levels and apply it on the input digital sig...

متن کامل

Net2Align: An Algorithm For Pairwise Global Alignment of Biological Networks

The amount of data on molecular interactions is growing at an enormous pace, whereas the progress of methods for analysing this data is still lacking behind. Particularly, in the area of comparative analysis of biological networks, where one wishes to explore the similarity between two biological networks, this holds a potential problem. In consideration that the functionality primarily runs at...

متن کامل

FOGSAA: Fast Optimal Global Sequence Alignment Algorithm

In this article we propose a Fast Optimal Global Sequence Alignment Algorithm, FOGSAA, which aligns a pair of nucleotide/protein sequences faster than any optimal global alignment method including the widely used Needleman-Wunsch (NW) algorithm. FOGSAA is applicable for all types of sequences, with any scoring scheme, and with or without affine gap penalty. Compared to NW, FOGSAA achieves a tim...

متن کامل

A Fast Algorithm for Exonic Regions Prediction in DNA Sequences

The main purpose of this paper is to introduce a fast method for gene prediction in DNA sequences based on the period-3 property in exons. First, the symbolic DNA sequences were converted to digital signal using the electron ion interaction potential method. Then, to reduce the effect of background noise in the period-3 spectrum, we used the discrete wavelet transform at three levels and applie...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
iranian journal of chemistry and chemical engineering (ijcce)

ناشر: iranian institute of research and development in chemical industries (irdci)-acecr

ISSN 1021-9986

دوره 30

شماره 2 2011

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023